RECENT ADVANCES IN IMAGE AND VIDEO RETRIEVAL Semantic classification of movie scenes using finite state machines
نویسندگان
چکیده
The problem of classifying scenes from feature films into semantic categories is addressed and a robust framework for this problem is proposed. It is proposed that the finite state machines (FSM) are suitable for detecting and classifying scenes and their usage is demonstrated for three types of movie scenes: conversation, suspense and action. This framework utilises the structural information of the scenes together with the low-level and mid-level features. Low level features of the video including motion and audio energy and a mid-level feature, body, are used in this approach. The transitions of the FSMs are determined by the features from each shot in the scene. The FSMs have been experimented on over 80 clips and convincing results have been achieved.
منابع مشابه
Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments
This paper addresses the problem of automatically partitioning a video into semantic segments using visual low-level features only. Semantic segments may be understood as building content blocks of a video with a clear sequential content structure. Examples are reports in a news program, episodes in a movie, scenes of a situation comedy or topic segments of a documentary. In some video genres l...
متن کاملSemantic Image and Video Indexing in Broad Domains
Image and video collections are abundant, but where for document retrieval standard products are widely available, access to image and video collections is still cumbersome. The reason is the semantic gap between what we can derive automatically from the visual data and the semantic interpretation a user has of the same data. To bring semantic access, bridges across the semantic gap have to be ...
متن کاملWhere to Play: Retrieval of Video Segments using Natural-Language Queries
In this paper, we propose a new approach for retrieval of video segments using natural language queries. Unlike most previous approaches such as concept-based methods or rule-based structured models, the proposed method uses image captioning model to construct sentential queries for visual information. In detail, our approach exploits multiple captions generated by visual features in each image...
متن کاملAutomatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique
The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...
متن کاملآشکارسازی و تعیین مکان متون فارسی - عربی در تصاویر ویدیویی
Video text detection plays an important role in applications such as semantic-based video analysis, text information retrieval, archiving and so on. In this paper, we propose a Farsi/Arabic text detection approach. First, with an appropriate edge detector, edges are extracted and then by using edges cross ponts, artificial corners are extracted. Artificial corner histogram analysis is done for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000